Model Selection

Efficient Attention Mechanism

# Efficient Attention Mechanism

Seerattention Decode Qwen3 4B AttnGates

Provide the AttnGate weights for the decoding phase in the SeerAttention-R paper, supporting the inference tasks of the Qwen3-4B model

Large Language Model

Modernbert Base Squad2 V0.2

QA model fine-tuned from ModernBERT-base-nli, supporting long-context processing

Question Answering System

Mistral 7B Instruct V0.2 Sparsity 30 V0.1

Mistral-7B-Instruct-v0.2 is an enhanced instruction fine-tuned large language model based on Mistral-7B-Instruct-v0.1, achieving 30% sparsity through Wanda pruning method without requiring retraining while maintaining competitive performance.

Large Language Model

Nystromformer 4096

Long-sequence Nyströmformer model trained on WikiText-103 v1 dataset, supports sequence processing up to 4096 tokens

Large Language Model

Nystromformer 2048

Nystromformer model trained on the WikiText-103 dataset, supporting long sequence processing (2048 tokens)

Large Language Model

Long T5 Tglobal Base

LongT5 is a text-to-text transformation model based on the T5 architecture, employing transient global attention mechanism for efficient processing of long sequence inputs

Large Language Model English

Deit Tiny Distilled Patch16 224

This model is a distilled version of the Data-efficient image Transformer (DeiT), pretrained and fine-tuned on ImageNet-1k at 224x224 resolution, efficiently learning from a teacher model through distillation.

Image Classification

Bart Base Cnn R2 18.7 D23 Hybrid

This is a pruned and optimized BART-base model, specifically fine-tuned on the CNN/DailyMail dataset for summarization tasks.

Text Generation

Transformers English

Chinese Bigbird Mini 1024

This is a Chinese pre-trained model based on the BigBird architecture, optimized for Chinese text processing and supporting long text sequence handling.

Large Language Model

Transformers Chinese

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase